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Description 

FIELD OF THE INVENTION 



ThA nresent invention relates Generally to the use of certain libraries of synthetic compounds to prepare drugs, 
diagnostic reagents, pesticides or herbicides, which libraries can be made by stochastic methods for synthesizing ran- 
dom oligomers, with particular emphasis on particle-based synthesis methods. The invention involves the use of iden- 
tification tags on the particles to facilitate identification of the oligomer sequence synthesized. 

BACKGROUND OF THE INVENTION 

The relationship between structure and activity of molecules is a fundamental issue in the study of biological sys- 
tems Structure-activity relationships are important in understanding, for example, the function of enzymes, the ways in 
which cells communicate with each other, and cellular control and feedback systems. Certain macromolecules are 
known to interact and bind to other molecules having a very specific three-dimensional spatial and electronic distribu- 
tion. Any large molecule having such specificity can be considered a receptor, whether the molecule is an enzyme cat- 
alyzing hydrolysis of a metabolic intermediate, a cell-surface protein mediating membrane transport of ions, a 
glycoprotein serving to identify a particular cell to its neighbors, an IgG-class antibody circulating in the plasma, an oli- 
gonucleotide sequence of DNA in the genome, or the like. The various molecules that receptors selectively bind are 

known as ligands. ^ . . , m3t . 

Many assays are available for measuring the binding affinity of known receptors and ligands, but the information 
that can be gained from such experiments is often limited by the number and type of available ligands. Novel ligands 
are sometimes discovered by chance or by application of new techniques for the elucidation of molecular structure, 
including x-ray crystallographic analysis and recombinant genetic techniques for proteins. 

Small peptides are an exemplary system for exploring the relationship between structure and function in biology. A 
peptide is a polymer composed of amino acid monomers. When the twenty naturally occurring amino acids are con- 
densed into polymeric molecules, the resulting polymers form a wide variety of three-dimensional configurations, each 
resulting from a particular amino acid sequence and solvent condition. The number of possible pentapeptides of the 20 
naturally occurring amino acids, for example, is 20 5 or 3.2 million different peptides. The likelihood that molecules of this 
size might be useful in receptor-binding studies is supported by epitope analysis studies showing that some antibodies 
recognize sequences as short as a few amino acids with high specificity. Furthermore, the average molecular weight of 
amino acids puts small peptides in the size range of many currently useful pharmaceutical products. Of course, larger 
peptides may be necessary for many purposes, and polypeptides having changes in only a small number of residues 
may also be useful for such purposes as the analysis of structure-activity relationships. 

Pharmaceutical drug discovery is one type of research that relies on studies of structure-activity relationships. In 
most cases contemporary pharmaceutical research can be described as the process of discovering novel ligands with 
desirable patterns of specificity for biologically important receptors. Another example is research to discover new com- 
pounds for use in agriculture, such as pesticides and herbicides. 

Prior methods of preparing large numbers of different oligomers have been painstakingly slow when used at a scale 
sufficient to permit effective rational or random screening. For example, the "Merrif ield" method Merrifield, J, Am, Chem, 
Sqc 85 21 49-21 54(1 963), has been used to synthesize peptides on a solid support. In the Merrifield method, an ammo 
acid is covalently bonded to a support made of an insoluble polymer. Another amino acid with an alpha protected group 
is reacted with the covalently bonded amino acid to form a dipeptide. The protective group is removed, and a third 
amino acid with an alpha protective group is added to the dipeptide. This process is continued until a peptide of a 
desired length and sequence is obtained. Using the Merrifield method, one cannot economically and practically synthe- 
size more than a few peptide sequences in a day. 

To synthesize larger numbers of oligomer sequences, others have proposed the use of a series of reaction vessels 
for oligomer synthesis. For example, a tubular reactor system may be used to synthesize a linear oligomer on a solid 
phase support by automated sequential addition of reagents. This method still does not enable the synthesis of a suf- 
ficiently large number of oligomer sequences for effective economical screening. 

Methods of preparing a plurality of oligomer sequences are also known in which a fbraminous container encloses 
a known quantity of reactive solid supports, the solid supports being larger in size than openings of the container. See 
U S Patent No 4 631.211 The containers may be selectively reacted with desired materials to synthesize desired 
sequences of product molecules. As with other methods known in the art. this method cannot practically be used to syn- 
thesize a sufficient variety of polypeptides for effective screening. 

Other techniques have also been described. One bead-based method is described in PCT patent publication No. 
92/00091 . These methods include the synthesis of peptides on 96 plastic pins that f it the format of standard microliter 
plates See PCT patent publications 84/03564; 86/00991; and 86/06487 Unfortunately, while these techniques have 
been somewhat useful, substantial problems remain. For example, these methods continue to be limited in the diversity 
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of sequences which can be economically synthesized and screened. 

Others have developed recombinant methods for preparing collections of oligomers. See PCT patent publication 
Nos. 91/17271 and 91/19818. 

In another important development, scientists combined the techniques of photolithography, chemistry, and biology 
5 to create large collections of oligomers and other compounds on the surface of a substrate. See US. Patent No. 
5,143,854 and PCT patent publication Nos. 90/15070 and 92/10092 

In the recombinant and VLSI PS™ combinatorial methods, one can uniquely identify each oligomer in the library by 
determining the coding sequences in the recombinant organism or phage or by the location of the oligomer on the 
VLSI PS™ chip. In other methods, however, the identity of a particular oligomer may be difficult to ascertain. What is 
10 needed in these latter methods is an efficient and simple-to-use method for tagging each particle. Although tagging 
methods have been developed for large objects, see PCT patent publication Nos. 90/14441 and 87/06383, such meth- 
ods are still needed for combinatorial libraries of oligomers. 

From the above, one can recognize that improved methods for synthesizing a diverse collection of chemical 
sequences would be beneficial so as to permit drug, diagnostic reagent, pesticide or herbicide discovery using the 
is resulting diverse collections. 

SUMMARY OF THE INVENTION 

The invention provides a process for preparing a new pharmaceutical drug or diagnostic reagent, which includes 
20 the step of screening against a ligand or receptor a library of different synthetic compounds, which compounds are 
obtainable by synthesis in a component by component fashion which links each compound to one or more identifier 
tags which enable subsequent identification of reactions through which said components were incorporated and con- 
sequent deductive structural identification of said members. 

In another aspect the invention provides a process for preparing a new pharmaceutical drug or diagnostic reagent, 
25 which includes the step of screening against a ligand or receptor a tagged synthetic oligomer library produced by syn- 
thesizing on each of a plurality of solid supports a single oligomer sequence and one or more identifier tags identifying 
said oligomer sequence, said oligomer sequence and identifier tags synthesized in a process comprising the steps of: 

(a) apportioning said supports among a plurality of reaction vessels; 
30 (b) exposing said supports in each reaction vessel to a first oligomer monomer and to a first identifier tag; 

(c) pooling said supports; 

(d) apportioning said supports among a plurality of reaction vessels; 

(e) exposing said supports to a second oligomer monomer and to a second identifier tag monomer; and 

(f) repeating steps (a) through (e) from at least one to twenty times. 

35 

Yet a further aspect is the use of a solid support in pharmaceutical drug or diagnostic reagent identification, said 
solid support comprising a first particle attached to a second particle, said first particle linked to an oligomer and said 
second particle linked to an oligonucleotide identifier tag. and wherein said oligomer is other than an oligonucleotide. 

Another aspect is a process for preparing a new pharmaceutical drug or diagnostic reagent, which includes the 
40 step of screening against a ligand or receptor an oligomer library which is obtainable by a process comprising: 

In yet further aspects, the invention provides processes or a use as above but modified such that the purpose is to 
produce a pesticide or herbicide. 

The present invention utilises a general stochastic method for synthesizing libraries of random oligomers. The ran- 
dom oligomers are synthesized on solid supports, or particles, but may be cleaved from these supports to provide a sol- 
45 uble library. The oligomers are composed of a sequence of monomers, the monomers being any member of the set of 
molecules that can be joined together to form an oligomer or polymer, i.e., amino acids, carbamates, sulfones. sulfox- 
ides, nucleosides, carbohydrates, ureas, phosphonates, lipids, esters, combinations of the same, and the like. The 
Iforary is then screened to isolate individual oligomers that bind to a receptor or possess some desired property. Each 
oligomer sequence in the library is unique, in a preferred embodiment. In another preferred embodiment, the solid sup- 
so ports are nonporous beads. The solid supports may be composed of a single particle, or two or more linked particles. 

The invention involves the use of an identifier tag to identify the sequence of monomers in the oligomer. The iden- 
tifier tag. which may be attached directly to the oligomer with or without an accompanying particle, to a linker attached 
to the oligomer, to the solid support upon which the oligomer is synthesized, or to a second particle attached to the oli- 
gomer-carrying particle, may be any recognizable feature that in some way carries the required information, and that is 
55 decipherable at the level of one or a few solid supports. The solid supports may be joined to the oligomers and the iden- 
tifier tag by means of one or more linker molecules. 

In a preferred embodiment, the identifier tag will be an oligonucleotide, preferably composed of pyrimidines or pyri- 
midines and purine analogs or any type of nucleoside that will not degrade under the coupling conditions used to 
assemble the oligomer library. The oligonucleotide identifier tag may contain a 5* and a 3' amplification site, to allow 
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• / ^ Patent Nos 4.683,202; and 4,965.1 88. 
«ni« thP nolvmerase chain reaction (see US raiem . included «n 

. — : M ♦fio art 



to 



known in the art. 

BRIEF DESCRIPTION OF THE FIGURES 
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BRIEFDESCHiri.^^. oarticles 

Fig . n is a schema* repress^ of round 

Fig. 2 is a schematic J^t-wW^ fJ^ESSSSI the compatible cherrustnesforp^^es 

Fig. 3 describes one method ot including synthesis of am»no-f unrtKmal^ed _ ^ o|i 

gooucleobde synthesis. N^isopropyl phosphoramidite) nudeosrie derwat-vestor p ^ 

Rg .6illustrates5--DMT-3-(0-aHy.N.N< 1 u S opopy ^ 

<*» des - onoi assembly of oligonucleotide-tagged peptides ™J^ S \ ^ populations of oli- 

numbers of beads. Non-flu(«scenny smaller peak is 1 5:1 . obtained by ampli- 

resentedbypeakB.Thera*oof^^^ as 
Fig. 11 shows pictures of ethri.um txom^ |rfjcatton of the tags on the sorted i i . bead 

nation after FACS of two Jhlor« ^orescent beads: lane 1 - _ ^ 1 p ° CR C ^ od ^ from ten 

described in Example 5. Gel A shows 'esuns fluore scent beads; '^.f^-^xV copies (100 

equivalents) of 95 mer tag lanes ^ "^^^ hun dred fluorescent L 1 - 1 -2x10* copies 

fluorescent beads; lanes 1 1 -13 - P ^ me result wfth sorted non-f luorescent b ead * » ^ ^ 

bead equivalents) o"^ 0 ^ ^^sing.e non-fluorescent J^J^. Jfi™ u - 2.4x10* copies 
of 110 mer tag: lanes 2-7 - PCR proa ^ nundr ed ^-''^f 0 ^^ -ize standards; lanes 2. 3 - no tag 

fluorescem beads; Ianes^-13 --PCR^ tenes 112 - DNA s«e ^ « 9 5 

of 95 mer tag. Gel C shows the < esuj wrth ^ ^ g5 t ,anes 6 7 ; ; 0 b ^ ujva|ents of so.uble 1 1 0 
ri; e S, ; s^ equSnTolso^ H0 mertag. and lanes 10, 



mer tag; lanes 8,9 
45 mer tag. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS larQe synthe tic oligomer libraries. In a pre- 
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nucleotides are, generally, A and T (or A and U). and C and G, as is well known to those of skill in the art. Two single 
stranded RNA or DNA molecules are said to be "substantially complementary" when the nucleotides of one strand, opti- 
mally aligned, pair with at least about 80% of the nucleotides of the other strand. 

Alternatively, substantial complementarity exists when an RNA or DNA strand will hybridize under selective hybrid- 

5 ization conditions to a complementary nucleic acid. Typically, selective hybridization will occur when there is at least 
about 55% complementarity over a stretch of at least 14 to 25 nucleotides, but more selective hybridization will occur 
as complementarity increases to 65%, 75%, 90%, and 100%. See Kanehisa. Nucleic Acids Res. 12:203 (1984) 

Stringent hybridization conditions will typically include salt concentrations of less than about 1 M, such as less than 
500 mM, and will often include salt concentrations of less than 200 mM. The hybridization temperature for oligomers 

w will typically be greater than 22°C. such as greater than about 30°C, and will often be in excess of about 37°C. Longer 
fragments may require higher hybridization temperatures for specific hybridization. As other factors may dramatically 
affect the stringency of hybridization (such factors include base composition, length of the complementary strands, 
presence of organic solvents, and extent of base mismatching), the combination of factors is more important than the 
absolute measure of any one factor alone. 

is Epitope : The portion of an antigen molecule delineated by the area of interaction with the subclass of receptors 
known as antibodies is an "epitope." 

Identifier tag : An "identifier tag" is a physical attribute that provides a means whereby one can identify which mon- 
omer reactions an individual solid support has experienced in the synthesis of an oligomer. The identifier tag also 
records the step in the synthesis series in which the solid support visited that monomer reaction. The identifier tag may 

20 be any recognizable feature, including for example: a microscopically distinguishable shape, size, color, optical density, 
etc.; a differential absorbance or emission of light; chemically reactivity; magnetic or electronic encoded information; or 
any other distinctive mark with the required information, and decipherable at the level of one (or a few) solid support(s). 
A preferred example of such an identifier tag is an oligonucleotide sequence. An "identifier tag" can be coupled directly 
to the oligomer synthesized, whether or not a solid support is used in the synthesis. In thin latter embodiment, the iden- 

25 tifier tag serves as the "support" for oligomer synthesis. 

Ligand: A "ligand" is a molecule that is recognized by a particular receptor. The agent bound by or reacting with a 
receptor is called a "ligand", a term which is def initionally meaningful only in terms of its counterpart receptor. The term 
"ligand" does not imply any particular molecular size or other structural or compositional feature other than that the sub- 
stance in question is capable of binding or otherwise interacting with the receptor. Also, a "ligand" may serve either as 

30 the natural ligand to which the receptor binds, or as a functional analogue that may act as an agonist or antagonist. Lig- 
ands that can be investigated by this invention include, but are not restricted to, agonists and antagonists for cell mem- 
brane receptors, toxins and venoms, viral epitopes, hormones, sugars, cofactors, peptides, enzyme substrates, drugs 
(e.g., opiates, steroids, etc.), and proteins. 

Monomer : A "monomer" is any member of the set of molecules which can be joined together to form an oligomer 

35 or polymer. The set of monomers useful in the present invention includes, but is not restricted to, for the example of pep- 
tide synthesis, the set of L-amino acids, D-amino acids, or synthetic amino acids. As used herein, "monomer" refers to 
any member of a basis set for synthesis of an oligomer. For example, dimers of L-amino acids form a basis set of 400 
"monomers" for synthesis of polypeptides. Different basis sets of monomers may be used at successive steps in the 
synthesis of a polymer. 

to Oligomer or Polymer : The "oligomer" or "polymer" sequences of the present invention are formed from the chemi- 
cal or enzymatic addition of monomer subunits. Such oligomers include, for example, both linear, cyclic, and branched 
polymers of nucleic acids, polysaccharides, phospholipids, and peptides having either alpha-, beta-, or omega-amino 
acids, heteropolymers, polyurethanes, polyesters, polycarbonates, polyureas. pdyamides. polyethyleneimines, pol- 
yarylene sulfides, polysiloxanes, polyimides, polyacetates, or other polymers, as will be readily apparent to one skilled 

45 in the art upon review of this disclosure. 

Peptide : A "peptide" is an oligomer in which the monomers are alpha amino acids joined together through amide 
bonds. Alternatively, a "peptide" can be referred to as a "polypeptide." In the context of this specification, one should 
appreciate that the amino acids may be the L-optical isomer or the D-optical isomer. Peptides are more than two amino 
acid monomers long, but more often are more than 10 amino acid monomers long and can be even longer than 20 

so amino acids, although peptides longer than 20 amino acids are more likely to be called "polypeptides." Standard single 
letter abbreviations for amino acids are used (e.g.. P for proline). 

Oligonucleotides : An "oligonucleotide" is a single-stranded DNA or RNA molecule, typically prepared by synthetic 
means. Those oligonucleotides employed in the present invention will usually be 50 to 150 nucleotides in length, pref- 
erably from 80 to 120 nucleotides, although oligonucleotides of different length may be appropriate in some circum- 

55 stances. For instance, in one embodiment of the invention, the oligonucleotide tag and the polymer identified by that tag 
are synthesized in parallel. In this embodiment, the oligonucleotide tag can be built nucleotide-by-nucleotide in coordi- 
nation with the monomer-by-monomer addition steps used to synthesize the oligomer. In addition, very short, i.e., 2 to 
10 nucleotides, oligonucleotides may be used to extend an existing oligonucleotide tag to identify a monomer coupling 
step. Suitable oligonucleotides may be prepared by the phosphoramidite method described by Beaucage and Carru- 
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thers Tetr Lett 221859-1862 (1981). or by the triester method, according to Matteucci et at. .1 Am. Chem. Sop. 
UB31"S(1»T) « by other methods such as by using commercial automated oligonucleotide; s*^** 

SU^oLt A nudeic acid is "operably linked" when p.aced into a functional »>-^ 
acid sequence For instance, a promoter or enhancer is "operably linked" to a codmg sequence rf the promo ^causes 
SSSSn of the sequent Generally operably linked means that the DNA sequences bang hnked are contigu- 
n, « and where necessary to join two protein coding regions, contiguous and in reading irame. 

R^oT a"^S is a molecule that has art affinity for a given ligand. Receptors may be £ 
mannSndecuS «so receptors can be employed in their unaltered natural or isolated state or as aggregates w.th 
ISStoTmSS attached, covalentiy or noncovalently. to a binding member, either directly or via a 
ZSSZqSSSSL Samples of receptors mat can be employed in the rr^ 

Sre JoT estricS to antibodies, cell membrane receptors, monoclonal antibodies, antisera reactive^ w.th speatic 
a^ aenicltermSante such as on viruses, cells, or other materials), polynucleotides, nucle.cac.ds. lectins, polysac- 
ce Is TelE merrfcranes and organelles. Receptors are sometimes referred to in the art as "anti-l-gands^ 
SheTerm Z^fSSSJlk no difference in meaning is intended. A ligand-receptor pair" is formed when two 
macromolecules have combined through molecular recognition to form a complex. . . 

Other examples of receptors that can be investigated by this invent, on include, butare not re^Krtedto. 

^w^mreceotore: Determination of ligands that bind to microorgamsm receptors, such as specrficfransport 
DrotSofSei^ial to survival of microorganisms, is useful in discovering new classes or types of anttowti«. 
Carticrr^retSTbe antibiotics against opportunistic fungi, protozoa, and those bactena res,stant to the antib.- 

^"Stance, the binding s*e of any enzyme, such as the enzymes ^^^^^ 
mitt^JsVreceptor. Determination of ligands that bind to certain enzymes .and 

enzymes that cleave the different neurotransmitters, is useful in the development of drugs that can be used .n the treat 
„, llothatromh i nes with the eoitooe of an antigen of interest. Determining a sequence that mimics an antigenic epitope 

^ proximate to the binding site, which functionality is capable of chemically modifying the bound reactant. Catalytic 

DeteS^tiSnds which bind with high affinity to a hormone receptor is useful in the dev^ent of. for 
o«^TJ *r ^ral reDtocemerrt of the daily injections which diabetics must take to relieve the symptoms of diabetes, and 
ZSSSSL T S£££ S hXJ growth hormone, which can only be obtained from ^^^LSZ 
WnantDNA Z^okf Other examples include the vasoconstrictive hormone receptors; determ.nat.on of ..gands that 
KinH *n thr«p rpceotors mav lead to the development of drugs to control Wood pressure. 

^IvnmeiTconvound is "synthetic" when produced by in yjtro chemical or enzymatic synthesis. The synthetic 
HbraSS ^^onZay be contrasted with those in viral or plasm* vectors, for instance, which may be 
i propagated in bacterial, yeast, or other living hosts. 

i MPthnrt for Producing ' ^mo Synthetic Oligomer Libraries 

A oeneral method of random oligomer synthesis can be used to produce the enormous numbers of compounds 
, availlte Sre^nSnant^Sems ar* to utilize the monomer set diversity availaUe with chemical s^'* m«^ 
^ri^S^SuJito 1° 12 different oligomers, a dramatic improvement over pre^us methods. The inven- 

comprising a single oligomer sequence (e.g., a peptide). The sequence may be soluble or may be bound to a solid sup 



EP 0 773 227 A1 



port. When bound to a solid support, the oligomer is usually attached by means of a linker. The linker, prior to attach- 
ment, has an appropriate functional group at each end, one group appropriate for attachment to the support and the 
other group appropriate for attachment to the oligomer. Such a collection may contain, for example, all combinations of 
n monomers assembled into X length oligomers yielding, n x different compounds. The collection may also contain oli- 

5 gomers having different monomer units at, for example, only one or a small number of positions, while having an iden- 
tical sequence at all other positions. The general method typically involves synthesizing the oligomers in a random 
combinatorial ("stochastic") fashion by chemical and/or enzymatic assembly of monomer units. 

A synthetic oligomer library may be produced by synthesizing on each of a plurality of solid supports a single oli- 
gomer sequence, the oligomer sequence being different for different solid supports. The oligomer sequence is synthe- 

io sized in a process comprising the steps of: (a) apportioning the supports in a stochastic manner among a plurality of 
reaction vessels; (b) exposing the supports in each reaction vessel to a first monomer; (c) pooling the supports; (d) 
apportioning the supports in a stochastic manner among the plurality of reaction vessels; (e) exposing the supports in 
each reaction vessel to a second monomer; and (f) repeating steps (a) through (e) from at least one to twenty times. 
Typically, substantially equal numbers of solid supports will be apportioned to each reaction vessel. The monomers may 

15 be chosen from the set of amino acids, and the resulting oligomer is a peptide. 

As a specific example of the method, one may consider the synthesis of peptides three residues in length, assem- 
bled from a monomer set of three different monomers: A, B, and C. The first monomer is coupled to three different aliq- 
uots of beads, each different monomer in a different aliquot, and the beads from ail the reactions are then pooled (see 
Fig. 1). The pool now contains approximately equal numbers of three different types solid supports, with each type char- 
ge acterized by the monomer in the first residue position. The pool is mixed and redistributed to the separate monomer 
reaction tubes or vessels containing A, B. or C as the monomer. The second residue is coupled. 

Following this reaction, each tube now has beads with three different monomers in position one and the monomer 
contained in each particular second reaction tube in position 2. All reactions are pooled again, producing a mixture of 
beads each bearing one of the nine possible dimers. The pool is again distributed among the three reaction vessels, 

25 coupled, and pooled. This process of sequential synthesis and mixing yields beads that have passed through all the 
possible reaction pathways, and the collection of beads displays all trimers of three amino acids (3 3 27). Thus, a com- 
plete set of the trimers of A, B, and C is constructed. As can be readily appreciated, the use of a sufficiently large 
number of synthesis beads helps to ensure that the set completely represents the various combinations of monomers 
employed in this random, combinatorial synthesis scheme. 

30 This method of assembling oligomers from many types of monomers requires using the appropriate coupling 
chemistry for a given set of monomer units or building blocks. Any set of building blocks that can be attached to one 
another in a step-by-step fashion can serve as the monomer set. The attachment may be mediated by chemical, enzy- 
matic, or other means, or by a combination of any of these means. The resulting oligomers can be linear, cyclic, 
branched, or assume various other conformations as will be apparent to those skilled in the art. Techniques for solid 

35 state synthesis of polypeptides are described, for example, in Merrifield. supra . Peptide coupling chemistry is also 
described in The Peptides. Vol. 1 (eds. Gross, E., and J. Meienhofer, Academic Press, Orlando (1979)) 

To synthesize the oligomers, a collection of a large number of the solid supports is apportioned among a number 
of reaction vessels. In each reaction, a different monomer is coupled to the growing oligomer chain. The monomers may 
be of any type that can be appropriately activated for chemical coupling or accepted for enzymatic coupling. Because 

40 the reactions may be contained in separate reaction vessels, even monomers with different coupling chemistries can 
be used to assemble the oligomers (see The Peptides, supra) . The coupling time for some of the monomer sets may 
be long. For this reason, the preferred arrangement is one in which the monomer reactions are carried out in parallel. 
After each coupling step, the solid supports on which are synthesized the oligomers of the library are pooled and mixed 
prior to re-allocation to the individual vessels for the next coupling step. This shuffling process produces solid supports 

45 with many oligomer sequence combinations. If each synthesis step has high coupling efficiency, then substantially all 
the oligomers on a single solid support have the same sequence. That sequence is determined by the synthesis path- 
way (type and sequence of monomer reactions) for any given solid support at the end of the synthesis. The maximum 
length of the oligomers is typically less than about 20. usually from 3 to 8 residues in length, but in some cases a length 
of 10 to 12 residues is preferred. Protective groups known to those skilled in the art may be used to prevent spurious 

so coupling (see The Peptides. Vol. 3 (eds. Gross. E.. and J. Meienhofer, Academic Press, Orlando (1981) 

Modifications of this completely random approach are also possible. For example, the monomer set may be 
expanded or contracted from step to step; or the monomer set could be changed completely for the next step (e.g., 
amino acids in one step, nucleosides in another step, carbohydrates in another step), if the coupling chemistry were 
available (see Gait. Oligonucleotide Synthesis: A Practical Approach. IRL Press. Oxford (1984); Friesen and Danishef- 

55 sky, J. Amer. Chem. Soc . 111:6656 (1 989); and Paulsen, Anaew. Chem. Int. Ed. Enal. 25:21 2 (1 986) 

Monomer Units for peptide synthesis, for example, may include single amino acids or larger peptide units, or both. One 
variation is to form several pools of various sequences on solid supports to be distributed among different monomer 
sets at certain steps of the synthesis. By this approach, one can also build oligomers of different lengths with either 
related or unrelated sequences, and one can fix certain monomer residues at some positions while varying the other 
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' ns are altered to provide diversity. 
l%An some embodiments. tag f^chmert ^and7o ^ ^ supports are ^monly «edto » ^ ^ to tnose 

»i»^«^ B, ?^ w S5Siiii' <*9° mere as enumerate ° 

for example. pepWJes and nucle.cac.as rtmolete sets of certain oligomers, if desred. 

skilled in the art. rt ^.ng, 0 ne can generate <^«"?L solid support of up to 1 mm .n 

With enough sold supports a nd e Jrt* P u nm to 100 ^m, but a more ^^.^ynthesis sites and 

peptidesyn m esizingres.ns..sabout0.l pgo p p» resin8 are preferable 

about 10 8 oligomer chains. ^ rous than typical peptide f ofde rs of 

r^^ing'tf^ dorrtnant ^"^^V^j^'apy shape although they wiO ^ e ^ et ^^^ ] ^^^^l^e^i^e'^^*^ : ^^^ 
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described herein primarily with regard to the preparation of molecules containing sequences of amino acids, but the 
invention can readily be applied to the preparation of other oligomers and to any set of compounds that can be synthe- 
sized in a componerrt-by-component fashion, as can be appreciated by those skilled in the art. 

In another embodiment, the same solid support is used for synthesizing all members of the library, but the members 

5 are cleaved from the support prior to screening. In this embodiment, synthesis of tagged oligomers may be accom- 
plished utilizing very large scale immobilized polymer synthesis (VLSI PS™) techniques. See U.S. Patent No. 5,143,854 
and PCT patent publication No. 92/10092, each of which is incorporated herein by reference. An array of oligonucle- 
otides is synthesized on the VLSI PS™ chip, each oligonucleotide linked to the chip by a cleavable group such as a 
disulfide. In one embodiment, each oligonucleotide tag has an amine group at the free end and only contains pyrimidine 

10 or pyrimidine and purine analog bases. In addition each oligonucleotide contains binding sites for amplification, i.e., 
PCR primer sites and optionally a sequencing primer site. A short section of each oligonucleotide uniquely codes the 
monomer sequence of the oligomer to be tagged. Then, e.g. , peptides are synthesized, optionally from the free terminal 
amine groups on each oligonucleotide, so that each peptide is linked to a tag. The whole collection of oligonucleotide- 
peptide may be released from the chip to create a soluble tagged oligomer library. 

is More preferably, however, the oligomer library is constructed on beads or particles. One method of bead f unction - 
alization, with compatible chemistries for peptide synthesis and round by round attachment of oligonucleotide identifier 
tags, is shown in Figs. 3.1-3.6. Glass beads are derivatized using aminopropyttriethoxysilane and a beta-alanine spacer 
group is coupled using activated ester methodology. The oligonucleotide tags may optionally incorporate a biotin group 
to facilitate purification, hybridization, amplification, or detection (see Pierce ImmunoTechnoloov Catalog and Hand- 

20 book. 1991 Commercially available Fmoc protected amino acids and standard BOP coupling chemistry is employed for 
peptide synthesis (see The Peptides, supra) . Protected polypyrimidine (e.g., cytidine protected as N 4 -Bz-C) and/or 
purine analog containing oligonucleotides resistant to the coupling and deprotection reagents used in peptide synthesis 
are attached using maleimide chemistry to unmasked thiol groups incorporated into growing peptide chains at low fre- 
quency (i.e., 0.1%) as cysteine residues with masked thiol groups (which masks may be selectively removed prior to 

25 tagging). In other embodiments of the invention, one may not need to use protected nucleosides or oligonucleotides. 

However, to maintain the integrity of an oligonucleotide tag during peptide synthesis, one may need to use different 
combinations of protecting groups and/or synthetic nucleotides to avoid degradation of the tag or the oligomer synthe- 
sized. In general, polypyrimidine oligonucleotide tags are relatively stable under typical peptide synthesis conditions, as 
opposed to oligonucleotide tags that contain natural purine nucleotides, but a polypyrimidine nucleotide tag may be 

30 somewhat refractory to amplification by PCR. One may need to incorporate purine bases, or analogs tested for ability 
to withstand peptide coupling (and deprotection) conditions, into the tag to acheive a desired efficiency of amplification. 
For purposes of the present invention, the tag optionally may contain from 10 to 90%, more preferably 35 to 50%, and 
most preferably 33 to 35%. purine or purine analog nucleotides. The oligonucleotides optionally may contain phosphate 
protecting groups (e.g., O-methyl phosphates) with greater base stability than the standard beta-cyanoethyi group, 

35 which may be susceptible to piperidine cleavage. In such cases, peptide and oligonucleotide deprotection can be 
effected by sequential treatment with thiophenol, trrf luoroacetic acid, and ethanolic ethylenediamine at 55 °C In another 
embodiment, photolabile alpha-ami no protecting groups are used in conjunction with base-labile side chain protecting 
groups for the amino acids, and standard beta-cyanoethyl protecting groups are used for the oligonucleotide tags. 
In another embodiment, oligonucleotides containing both modified or synthetic purines and pyrimidines may be 

40 synthesized in parallel with peptides using conventional Fmoc/Bu protected amino acids. In this method, one can also 
use O-allyl and N-allyioxycarbonyl groups to provide protection for phosphate oxygens and the exocyclic amines of the 
nucleoside bases, respectively (see Hayakawa et ah, J. Amer. Chem . Soc . 112 : 1691-1696 (1990) employing the mild 
oxidant l BuOOH for oxidation at the phosphorous, one can minimize oxidation of the amino acids methionine, tryp- 
tophan, and histidine (see Hayakawa ej al., Tetr. Left . 27:4191-4194 (1986) Use of pyridinium hydrochloride/imidazole 

45 as a phosphporamidite activator leads to selective S'-O-phosphitylation at the expense of low levels of spurious reaction 
at nitrogen on the peptide or oligonucleotide (see Gryaznov and Letsinger, Nucleic Acids Research 20: 1879-1882 
(1992) The lability of purine nucleotides to strong acid (e.g., TFA) is avoided by use of phosphoramidites of the purine 
nucleoside analogs 7-deaza-2-deoxyadenosine and 7<Jeaza-2'<Jeoxyguanosine (see Barr e£ al., BioTechnigues 
4:428-432 (1986), and Scheit. Nucleotide Analogs: Synthesis and Biological Function pp. 64-65 (John Wiley and Sons, 

so New York) 

The fully assembled peptide and oligonucleotide chains may be deprotected by first treating the products with 30% 
piperidine in DMF to remove amino-terminal Fmoc groups. Then, the allylic protecting groups are removed using THF 
containing tris (dibenzylideneacetone) dipalladium-chloroform complex, triphenylphosphine, and n-butylamine/formic 
acid, followed by a THF wash, an aqueous sodium N.N-diethyldithiocarbamate wash, and a water wash. Finally, the 
55 acid-labile amino acid protecting groups are removed by treatment with 95:5 TFA/water. 

Other methods also provide effective orthogonal protection during the parallel assembly of oligonucleotides and 
peptides. These methods include use of acid-labile protecting groups on phosphates and exocyclic amines of deoxycy- 
tidine, 7-deaza-deoxyadenosine. and 7-deaza-deoxyguanosine sufficiently robust to resist the 3% trichloroacetic acid 
used in S'-O-detritylation; use of photochemically removable protecting groups on these residues; and combinations of 
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such acid and photolabile groups (for photolabile protecting groups for phosphate, see Baldwin et at. Tetr. Lett. 46: 
6879-6884 (1990) see also figure 5). 

in identifying th* Renuence of anv Oligomer 

The present invention provides a method for identifying the composition and sequence of any of the oligomers .in 
the library By tracking the synthesis pathway that each oligomer has taken, one can deduce the sequence of mono- 
mersofany o'igomAe method involves lining an identifier tag to the oligomer that ir^icates the monom^ 
and corresponding step numbers that define each oligomer in the library. After a series of synthesis steps (and concur- 
Tent SeSr tag addSons). one "reads- the identifier tag(s) associated with an oligomer to determine the sequence of 

th3t FoTSanple. one might attach microscopically recognizable, alphanumeric tags to each bead (see Fig. 2): -AT 
means that the bead participated in the A-monomer reaction at step 1. "C2" means ™*» 

monomer reaction at step 2. and "B3" means B-monomer was added in step 3 and so on. At the end £ *«JP*" 
thesis, the bead would have three tags attached, e.g., A1 , C2. and B3. indicating that the sequenc^of *e peptrfeson 
the bead is ACB This scheme requires a number of distinct identifier tags equal to at most the product of the number 
of different monomers and the number of synthesis steps (nine in this example). The number of identifier tags is 
fed^TthTsyn.bo.s are attached to one another in the order of the steps: A. A-C. A-C-B. In thfe case only as many 
identifier tags are needed as monomers. One builds the identifier tag in much the same way as the peptides, so as to 

preserve a recoid of what was monomer was added, and in which addition step. 

The identifier tags therefore identify each monomer reaction that an individual library rmember or solid ^"^as 
experienced and record the step in the synthesis series in which each monomer is added, rmy be attadied 

Mediately before, during, or atter the monomer addition reaction, as^^ 

tifiertag. modes of attachment, and chemistry of oligomer synthesis. The idenW.er tag is added when the solid supports 
that have undergone a specific monomer addition step are physically together and so can be tagged as a group, i.e.. 

prior to the next pooling step. .. . . „„„ „ aoH 

In some cases, of course, when only a small number of monomer units of an oligomer are varied, one may need to 
identify only those monomers which vary among the oligomers, as when one wants to vary only a few am.no ac,ds ma 
peptide. For instance, one might want to change only 3 to 6 amino acids in peptides 6 to 12 amino acids longer one 
Sight want to change as few as 5 amino acids in polypeptides up to 50 amino acids long. One may uniquely i ****** 
sequence of each peptide by providing for each solid support an identifier tag specifying only the amino acids varied in 
eTcn^uenct as^l. be readily appreciated by those skilled in the art.^such^se* all 

in me same reaction vessel tor the addition of common monomer units and apportioned among different reaction ves- 
sels for the addition of distinguishing monomer units. .. ^.^h _ 

The identifier tag can be associated with the oligomer through a variety of mechan.sms, ertherdirectly. througha 
linking molecule, or through a solid support upon which the oligomer is synthesized. In thel latte 
attach the tag to another solid support that, in turn, is bound to the solid support upon which the oligomer is synthe- 
sized. 

l\i Typ^ rrf Irlantifier Taos 

The identifier tag may be any recognizable feature that is. for example: microscopically distinguishable in shape, 
size, color, optical density, etc.; differently absorbing or emitting of light: chemically reactive; magnetically on electroni- 
cally encoded; or in some other way distinctively marked with the required information, and decipherable at the levd of 
one (or few) solid supports. In one embodiment, each bead or other solid support in the library incorpo ratesa variety of 
f luorophores. or other light addressable type of molecules, the spectral properties of which can be changed and there- 
fore used to store inforrnation. In one such mode, a bead incorporates a variety of fluorophors, each of which canbe 
selectively photoHeached. and so rendered incapable of fluorescence or of diminished fluoresce. During eachcou- 
pling step, the bead is irradiated (or not) to photobleach (or not) one or more particular types of fluorophors. thus record- 
ino the monomer identity in the oligomer synthesized. See Science 255: 1213 (6 Mar. 1992) 

On™cc^truct rn.croscopica.ly identifiable tags as small beads of recognizably different sizes shapes or 
colors, or labeled with bar codes. The tags can be "machine readable" luminescent or radioactive labels. The identifier 
tag can also be an encodabie molecular structure. The information may be encoded in the size (e.g. length of a poly- 
mer) or the composition of the molecule. The best example of this latter type of tag is a nucleic acid sequence, i.e.. RNA 
or DNA assembled from natural or modified bases. ™„™.,Ho«tiA»« ! 

Synthetic oligodeoxyribonucleotides are especially preferred information-bearing idenW.er te 9* 0 ^™ uc,e ^ 
are a natural, high density information storage medium. The identity of monomer type and the step of addrton is easily 
encoded in a short oligonucleotide sequence and attached, for example, to each peptide synthesis bead- When >a jingle 
bead is isolated by screening, e.g.. for receptor binding, the attached oligonucleotides can be ampM.ed by methods 
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such as PCR (see PCR Protocols: A Guide to Method s and Applications . (Innis. M, GeHand, D.. Sninsky. J. and White, 
T, Academic Press, San Diego 1990) or by other nucleic acid amplification techniques, such as the ligase chain reac- 
tion and the self-sustained sequence replication system. The amplified product can be easily sequenced or otherwise 
identified to decode the identity of the peptide on the bead. For this purpose, one can use any of a variety of sequencing 

5 methods, including sequencing by sequence-specific probe hybridization. 

Alternatively, the information may be encoded in the length rather than, or in addition to, the sequence of the oligo- 
nucleotide. If only oligonucleotide length is utilized to represent each specific monomer addition to the oligomer, then 
the identity of the oligomer can be decoded by amplifying the oligonucleotide, as described above, and identifying the 
labels through any of a variety of size-separation techniques, including potyacrylamide gel electrophoresis or capillary 

io electrophoresis. 

There are several ways that oligonucleotides can be used as identifier tags. The oligonucleotides can be assem- 
bled base-by-base before, during, or after the corresponding oligomer (e.g., peptide) synthesis step. In one case of 
base-by-base synthesis, the tag for each step is a single nucleotide, or at most a very few nucleotides (i.e., 2 to 5). This 
strategy preserves the order of the steps in the linear arrangement of the oligonucleotide chain grown in parallel with 
is the oligomer. To preserve the chemical compatibility of the parallel synthetic steps (oligonucleotides and peptides, for 
example), one can modify the standard synthesis chemistries. 

One variation of base-by-base assembly is the block-by-block approach; encoded sets of nucleotides ("codons") of 
5 to 10 or more bases are added as protected, activated blocks. Each block carries the monomer-type information, and 
the order of addition represents the order of the monomer addition reaction. Alternatively, the block may encode the oli- 
20 gomer synthesis step number as well as the monomer-type information. 

One could also attach protected (or unprotected) oligonucleotides containing amplification primer sites, monomer- 
specific information, and order-of-reaction information, from 10 to 50 to 150 bases in length, at each step. At the end of 
a series of n oligomer synthesis steps, there would be n differently encoded sets of oligonucleotide identifier tags asso- 
ciated with each oligomer sequence. After identifying the oligomers with ligand activity, the associated oligonucleotides 
25 are amplified by PCR and sequenced to decode the identity of the oligomer. 

V. Linking the Identifier Taq(s) to the Oligomer 

The identifier tags may be attached to chemically reactive groups (unmasked thiols or amines, for example) on the 
30 surface of a synthesis support functionalized to allow synthesis of an oligomer and attachment or synthesis of the oli- 
gonucleotide identifier tag. The tags could also be attached to monomers that are incorporated into a small proportion 
of the oligomer chains; or as caps on a small number of the oligomer chains; or to reactive sites on linkers joining the 
oligomer chains to the solid support. 

In one embodiment, the solid supports will have chemically reactive groups that are protected using two different 
35 or "orthogonal" types of protecting groups. The solid supports will then be exposed to a first deprotection agent or acti- 
vator, removing the first type of protecting group from, for example, the chemically reactive groups that serve as oli- 
gomer synthesis sites. After reaction with the first monomer, the solid supports will then be exposed to a second 
activator which removes the second type of protecting group, exposing, for example, the chemically reactive groups that 
serve as identifier tag attachment sites. One or both of the activators may be in a solution that is contacted with the sup- 
40 ports. 

In another embodiment, the linker joining the oligomer and the solid support may have chemically reactive groups 
protected by the second type of protecting group. After reaction with the first monomer, the solid support bearing the 
linker and the "growing" oligomer will be exposed to a second activator which removes the second type of protecting 
group exposing the site that attaches the identifier tag olrectly to the linker, rather than attachment directly to the solid 
45 support. 

When activators or deprotection agents are incorporated into the method of preparing a synthetic peptide library 
having a plurality of different members, each member comprising a solid support attached to a different single peptide 
sequence and an oligonucleotide identifier tag identifying said peptide sequence, the method comprises: a) apportion- 
ing the solid supports among a plurality of reaction vessels; b) reacting the solid supports with a solution in each reac- 

50 tion vessel and treating sequentially with (1) a first activator to remove a first type of protective group from the solid 
support, (2) a first amino acid or peptide to couple said amino acid or peptide to said solid support at sites where said 
first type of protective group has been removed; (3) a second activator to remove a second type of protective group from 
the solid support; and (4) a first nucleotide or oligonucleotide tag to couple said tag at sites where said second type of 
protective group has been removed; c) pooling the solid supports; d) apportioning the pooled solid supports among a 

55 plurality of reaction vessels; and e) repeating step (b) to couple a second amino acid or peptide and a second nucle- 
otide or oligonucleotide tag to said solid support. 

As noted above, the invention can also be carried out in a mode in which there is no solid support, and the tag is 
attached directly (or through a linker) to the oligomer being synthesized. The size and composition of the library will be 
determined by the number of coupling steps and the monomers used during the synthesis. Those of skill in the art rec- 
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ognize that either the tag or the monomer may be coupled first, in either embodiment. 

Another possible embodiment is the use of two solid supports, such as beads, that are physically linked together, 
one with synthesis sites (or linkers) for the oligomer and one with attachment sites (or linkers) for the identifier tags. This 
arrangement allows the segregation of oligomers and identifier tags into discrete "zones" and permits the use of widely 
different chemically reactive groups and chemistries for attachment. The solid supports can be derivatized separately 
and then linked under conditions where all or nearly all of the synthesis solid supports will have a tag-attachment solid 
support in tow. The solid supports can be of different sizes, as for example a large synthesis bead with several (or many) 
smaller tag-attachment beads linked. In one embodiment, the first solid support will have at least one attached amino 
acid and the second solid support will have at least one attached nucleotide. 

The mode of linking the two beads is constrained by the chemistry of oligomer synthesis. The most obvious means 
of linking the beads is with a heterobifunctional cross-lining agent (for examples of such agents, see Pierce ImmunoTe- 
chnoloav Catalog and Handbook pp. E1 0-E1 8 (1 991 )) interacting with the dominant chemically reactive groups on each 
species of solid support. 

VI. Encoding the Identifier Tag Information 

The choice of bases used in an oligonucleotide identifier tag is dictated by the chemistry of oligomer synthesis. For 
example, the use of strong acid to deprotect peptides would depurinate nucleic acids. Therefore, when standard chem- 
istries for peptide synthesis are employed, the pyrimidines C and T could be used in a binary code. Thus, in a preferred 
embodiment, the identifier tag will be an oligopyrimidine sequence. 

In another embodiment, the lability of purine nucleotides to strong acid may be overcome through the use of the 
purine nucleoside analogs, such as 7-deaza-2'-deoxyadenosine and 7-deaza-2-deoxyguanosine (see Barr el at, SiQz 
Techniques 4:428-432 (1986). and Scheit, Nucleotide A nalogs: Synthesis and Biological Function pp. 64-65 (John 
Wiley and Sons, New York) 

Use of these or other analogs would permit the use of a quaternary or other, as opposed to a binary, encoding 
scheme. 

Information retrieval from oligonucleotide identifier tags is possible through various encryption schemes, two of 
which are described below. In the first, the oligomer sequence information is at least in part encoded in the length of the 
oligonucleotide. Each different monomer added at a given step in the oligomer synthesis may be represented by an oli- 
gonucleotide tag of unique length. The oligonucleotide inherently contains amplification sites, such as PCR priming 
sequences, characteristic of the given step-number in the oligomer synthesis. Determination of the oligomer composi- 
tion at any given position in the sequence then involves amplifying the tag using the PCR priming sequence character- 
istic for that step in the synthesis and size-separating the amplification products utilizing techniques well known in the 
art. such as gel or capillary electrophoresis (using the tagging oligonucleotides as standards) This embodiment is par- 
ticularly useful when one desires to make a library of compounds related to a lead sequence. One need only tag during 
steps in which a site being analoged is synthesized. 

In addition to length, oligomer sequence information can also be encoded in the sequence of bases comprising the 
oligonucleotide tag. This type of encryption is of value not only in the embodiment in which one attaches a different oli- 
gonucleotide tag at each coupling step but also in the embodiment in which one extends an oligonucleotide tag at each 
coupling step. For example, as shown in Fig. 4, one may use oligonucleotides of up to about 100 bases (or somewhat 
longer), each having seven regions, as described below. 

Region 1 is a 3'-PCR primer site (20 to 25 bases). This site is used in conjunction with another PCR site (at the 5 - 
end of the oligonucleotide) to prime amplification by PCR. Other amplification methods may also be used. 

Region 2 is a "step-specrfic" DNA sequencing primer site (15-20 bases). This site is specific for the particular num- 
bered step in the synthesis series. All the oligonucleotides added to all the beads at a particular step will have this 
sequence in common. Each numbered step will have a highly specific primer site representing that step. 

Region 3 is a spacer (20-30 bases). A spacer segment of variable length, but preferably 20 to 30 bases long, places 
the coding site sufficiently distant from the sequencing primer site to give a good "read" through the monomer encoding 
or identification region. 

Region 4 is a monomer identification region (8 bases). Each base in this string represents one bit of binary code, 
where, for example, T = 0 and C = 1 . Each set of step-specific identifier tags consists of 8 bases with a 1 (C) or a O (T) 
at each of the 8 positions. These may be thought of as switches set to "on" or "or at the different positions. Each mon- 
omer type is encoded by a mixture of 1 to 8 of these "switches." 

Region 5 is a step number confirmation region (4 bases plus 2 bases on either side for region distinction). Four bits 
in this short stretch encode the step number. This is redundant to the sequencing primer but can be used to confirm 
that the proper primers were used and that the right step is decoded. 

Region 6 is a repeat of the monomer identification region (8 bases). This region has the same information as region 
4, and is used to confirm monomer identity. Installing this second monomer encoding region also increases the proba- 
bility that a good sequencing "read" will be obtained. 
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Region 7 is a 5'- PCR primer site (20 to 25 bases). This site serves as a site for annealing the second PCR primer 
for amplification of the sequence. The length of oligonucleotides with all seven of these features, some of which are 
optional, will commonly be between 75 and 125 bases. 

An 8 bit format can encode 256 different monomer types. The number of steps that can be encoded is determined 
5 by the number of step-specific sets (8 per set) of oligonucleotides on hand. With 1 0 sets (80 oligos) one can encode up 
to 256 different monomers assembled into oligomers up to 10 units long (thus providing encoding capability for up to 
256 10 = 1.2 x 10 24 oligomer sequences). The coded identifier tags may be used so that each monomer is assigned a 
specific binary number (e.g. Ala = 00000001 , Gly = 000001 10, etc.). The appropriate oligonucleotides are combined to 
give the correct binary code. 

10 

VII. Recovering and Decoding the Identifier Tag Information 

When specific beads are isolated in a receptor screening experiment, the beads can be segregated individually by 
a number of means, including: infinite dilution, micromanipulation, or preferably, fluorescence activated cell sorting 
is (FACS), although, with respect to the present invention. FACS is more accurately "fluorescence activated oligomer or 
solid support sorting" (see Methods in Cell Biology. Vol. 33 (DarzynWewicz, Z. and Crissman. H.A., eds., Academic 
Press); and Dangl and Herzenbero. J, Immunol. Methods 52:1-14 (1982) 

Once the desired beads have been isolated, one needs to identify the tag to ascertain the sequence of the oligomer 
on the bead. 

20 To facilitate tag identification, one has a variety of options. For instance, one could read the tag directly from the 
bead by sequencing or hybridization, if the tag is an oligonucleotide. One can also amplify oligonucleotide tags to facil- 
itate tag identification. The oligonucleotide identifier tags carried by a single solid support or oligomer can be amplified 
in viva by cloning, or in vitro, e.g., by PCR. If the limit of detection is on the order of 100 molecules, then at least 100 or 
more copies of each oligonucleotide tag on a bead would be required. Copies of the tag are produced, either as single 

25 stranded oligonucleotides, double-stranded nucleic acids, or mixtures of single and double-stranded nucleic acids, by 
any of a variety of methods, several of which are described below, and the amplified material is sequenced. In the 
embodiment of the invention in which a separate and distinct oligonucleotide tag is added at each monomer addition 
step (as opposed to extending an existing tag at each step), one can amplify all tags at once and then divide the ampli- 
fied material into as many separate sequencing reactions as there were oligomer synthesis steps (employing a different 

30 sequencing primer for each type of tag). In this embodiment, one could also design the tags so that each tag could be 
amplified separately from the other tags by appropriate choice of primer sequences. The sequencing reactions are per- 
formed and run on a standard sequencing gel, and the oligomer sequence is deduced from the code revealed in the 
resulting sequence information. 

An alternative strategy is to use common PCR primers and common sequencing primers (the sequencing primer 

35 may even overlap completely or partially with a PCR primer site) and identify the step by hybridization to oligonucleotide 
probes that are complementary to each step-specific sequence in the oligonucleotides from the bead. A single set of 
sequencing reactions is performed on all of the amplified oligonucleotides from a single bead, and the reaction products 
are run in a single set of lanes on a gel. The reaction products are then transferred to a suitable hybridization membrane 
and hybridized to a single step-specific probe (see Maniatis et al., Cold Spring Harbor Laboratory, Cold Spring Harbor, 

40 NY (1982) 

After detection of the resulting signal, the probe is washed from the membrane and another step-specific probe is 
hybridized. One could also use the procedure described in EPO publication No. 237,362 and PCT publication No. 
89/11548 

Parallel hybridization provides an alternative to sequential hybridization. The sequencing reactions are divided into 
45 a number of aliquots equal to the number of peptide synthesis steps and run in a separate set of lanes for each on the 
sequencing gel. After transfer of the reaction products to a suitable membrane, the membrane is cut to separate the 
sets of lanes. Each lane set is then hybridized to one of a plurality of step-specific oligonucleotide probes (see "Uniplex 
DNA sequencing" and "Multiplex DNA sequencing," in Plex Luminescent Kits Product Catalog. Bedford, MA, 1990 
As noted above, a single synthesis solid support (or an attached bead bearing a tag, or in solution in a "well") may 
so only comprise a few hundred copies of each oligonucleotide tag. These tags may be amplified. e.g., by PCR or other 
means well known to those skilled in the art, to provide sufficient DNA to be sequenced accurately. The ability to decode 
the oligomers depends on the number of available oligonucleotide identifier tags, the level of amplification that can be 
achieved from the available tags, and the accuracy of sequencing that amplified DNA. 

The most commonly used in vitro DNA amplification method is PCR. Alternate amplification methods include, for 
55 example, nucleic add sequence-based amplification (Compton, Nature 350:91 -92 (1 991 ), and amplified antisense RNA 
(Van Gelder ej Proc. Nat. Acad. Sci. USA g§: 7652-7656 (1988), which is incorporated herein by reference), and the 
serf-sustained sequence replication system (3SR, see Guatelli et al., Proc. Natl. Acad. Set. USA §7: 1874-1878 (1990) 
If PCR amplification of an oligonucleotide identifier tag is employed, one may encounter "PCR product contamina- 
tion," caused by the product of one PCR reaction contaminating a subsequent PCR reaction mixture designed to 
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Another means of proving ^"^^sueptavidin (BssSLlm^^ 
sequencing reaction for each *«P^" £ a single reaction and run .n '^^J^ aam for this pur- 

polymerases). Est DNA JJ*™^ " 1991 , Cleveland OH A nybri di Z ation technique To 

tws end. very large sc^e.mmoW. Z ed polym 
.... r%o /•» n«y*7 and 92/1 0588 



this end, very .a.y- — 
tion Nos. 92/10587 and 92/10588 

40 vm 



45 



50 



•ion rMU5>. ^« • v — 

gomer able to bind the receptor. The bourn analogous to FACS meth- 

immunoglobulin. . , .^ iuHua i beads displaying ligands on their sima metnods for selecting and 

^^^^^^^^^^^^ 

Alternatively, affinity adsorpton techn^u g nas b^n .mmob.l.^(s ^ ^ beads 

fixture of beads can . ^ »* - J- After washing to remove unbound beads, on 
No. 91/07087. incorporated herein oy 
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bound to the surface using conditions that reduce the avidity of the oligomer/receptor interaction (low pH. for example). 
The process of affinity adsorption can be repeated with the eluted beads, if desirable. Finally, individual beads are phys- 
ically separated, for example, by limited dilution, by FACS, or by methods similar to those in which cells are incubated 
with a receptor coupled to small superparamagnetic beads and then cells expressing a ligand for the receptor are 
5 extracted using a high power magnet (see Miltenyi el aj. , Cvtometerv 11:231 -238 (1 990), 

Magnetically selected cells can be further analyzed and sorted using FACS. Radionucleotides may also serve to label 
a receptor. 

Alternatively, the present invention can be used to generate libraries of soluble tagged oligomers, which can be 
used with a variety of screening methods. For instance, the oligomer library can be synthesized on beads with an iden- 

10 tifying tag encoding the oligomer sequence. The microscopic beads are placed in individual compartments or wells that 
have been "nanofabricated" in a silicon or other suitable surface. The oligomers are cleaved from the beads and remain 
contained within the compartment along with the bead and the attached identifier tag(s). In one embodiment, the bot- 
tom surface is coated with the receptor, and after the addition of binding buffer and a known ligand for that receptor that 
isfluorescently labelled, one effectively has a solution phase competition assay for novel ligands for the receptor. The 

is binding of the fluorescently labelled ligand to the receptor is estimated by confocal imaging of the monolayer of immo- 
bilized receptor. Wells with decreased fluorescence on the receptor surface indicate that the released oligomer com- 
petes with the labelled ligand. The beads or the tag in wells showing competition are recovered, and the oligonucleotide 
tag is amplified and sequenced to reveal the sequence of the oligomer. 

The beads are loaded in the wells by dispersing them in a volume of loading buffer sufficient to produce an average 

20 of one bead per well. In one embodiment, the solution of beads is placed in a reservoir above the wells, and the beads 
are allowed to settle into the wells. Cleavage of the oligomers from the beads may be accomplished using chemical or 
thermal systems, but a photocleavable system is preferred. 

Recovery of identifier-tagged beads from positive wells may be effectuated by a micromanipulator plucking out indi- 
vidual beads. However, a preferred mode involves the use of beads that have been previously labelled with a f luores- 

25 cent tag. A laser of the appropriate wavelength is then used to bleach the resident beads in only the positive wells. All 
the beads are then removed en masse and sorted by FACS to identify the bleached positives. The associated tags may 
then be amplified and decoded. 

In a variation of this assay, the oligomer and tag may be synthesized attached to a common linker, which, in turn, 
is bound to the solid support. After placing the beads in the wells, one can cleave the linker from the bead, producing a 

30 tagged oligomer in solution. An immobilized receptor, such as a receptor bound to a bead or a receptor immobilized on 
one surface of the well, can be screened in a competition assay with the oligomer and a fluorescently labeled ligand. 
Instead of recovering the beads, one may recover the beads bearing immobilized receptors and sort the beads using 
FACS to identify positives (diminished fluorescence caused by the library oligomer competing with the labeled ligand) 
or one can determine the fluorescence emitting from the well surface coated with receptor. The associated identifier tag 

35 may then be amplified and decoded: 

In a third variation of this approach, soluble tagged oligomers, produced either by cleavage of the linked oligomer 
and tag from the solid support as described above, or synthesized by the VLSIPS™ method described above, or syn- 
thesized in solution without a solid support, are incubated with an immobilized receptor. After a wash step, the bound, 
tagged oligomers are released from the receptor by, e.g., acid treatment The tags of the bound oligomers are amplified 

40 and decoded. 

IX. An Automated Instrument for Oligomer Synthesis and Tagging 

The coupling steps for some of the monomer sets (amino acids, for example) require a lengthy incubation time, and 
45 a system for performing many monomer additions in parallel is desirable. This can be accomplished with an automated 
instrument able to perform 50 to 100 parallel reactions (channels). Such an instrument is capable of distributing the 
reaction mixture or slurry of synthesis solid supports, under programmable control, to the various channels for pooling, 
mixing, and redistribution. 

Much of the plumbing typical of peptide synthesizers is required, with a large number of reservoirs for the diversity 
so of monomers and the number of tags (up to 80 for a 1 0 step synthesis, in one embodiment) employed. The tag dispens- 
ing capability will translate simple instructions into the proper mixture of tags and dispense that mixture. Monomer build- 
ing blocks will also be dispensed, as desired, as specified mixtures. Reaction agitation, temperature and time control 
may be provided. An appropriately designed instrument may also serve as a multi-channel peptide synthesizer capable 
of producing 1 to 50 mgs (crude) of up to 100 specific peptides for assay purposes. See PCT patent publication 
55 91/17823 
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EXAMPLE SYNTHESIS ON GLASS BEADS OF 4 FLUORESCENTLY TAGGED PENTAPEPTIDES 

A nfrrivatizatio" gj Glass. Beads 

^O.Sgofa-IO^ameter^^ 
m in. The beads were pelleted and washed with ^^l^^^X^ for 10 hours, pelleted and then 
Beads were vortexed with a 5% solution of ^'"^^Je^x *iea Tl25<>C for 45 min. Beads were sus- 
washed with acetone (2x). ethanol (5x), and ^ne chlorri^ ^ooWs) and a so.ution of Fmoc-b-alanine. pen- 
pended in dry DMF (1 m I) contammg d-'sopropyl^anune OJ^J^Si ml) was added. After vortex treatment 
t^luoropheny. ester (200 mg. 420 ^ 

for 1 1 hours, the beads were pelleted and washed with DMF (3x) «'2^^ ne to cap any underivatized 
a 10% solution of acetic anhydride in DMF contammg 0*5 md "^^^^^^^ with a 20% 
Lnopropy. groups, and then washed monHoring the absorbance 

solution of piperidine in DMF and the re ease «^ F ™?™^^e<« the degree of substitution of 10 ^moles 

(2x) and then dried at 85 °C for 12 hours, 
n prrrri'1 ,i """ <R,,r - rilv - L - pne -'-- Leu ' 0H 

, K m „. Ra r heil rt was dissolved in a solution containing distilled water 

Glycyl-L-Phenyla.anyl-L-leuc.ne (552 mg. 1 .5 m ™J^f*f^**^ ™ s baaM with a solution of di-tert-butyt 

(10 ml) and 1 M NaOH (1 .5 m ). The "^f^ ^ but redissotved after stirring 

pyrocarbonate (337 mg, 1 .5 mmol) .n p-d.oxane (12 miy A ^P^~ e s the residue taken up in water (5 

at room temperature for 4 hou^ The so.ut.ori .was c^ncenjat ^ *yne^ extracte d with EtOAc (2x. 

^ P ^raiinnnffi»v-L>Phe-L-LeuBea0s 

Boc-Gly-L-Phe-L-Leu-OH (44 
phosphate (44 mg. 0.1 mmol) and 1 ^^f^^^t^ ToTs m I of this solution was immediately trans- 
mi). Diisopropylethylamine (20 A 0.1 15 mmol) was ^T^^^M. The sealed tube was vortexed for 
ferred to a microcentrifuge tube conta.mng 80 mg of am.no * The beads were 

3.5 hours, and the beads were then peHeted and washed £rth ™F^and^j« y. ^ 
then deprotected with a 50% solution of trrtluoroacebc acrf .n hour 
chtond7(2x). ethanol (2x), and methylene chlonde (2x). and dned at 55 C for 1 hour. 

D Preeara&a G'v-^ Y- 1 -w^L-Lau Beads (SEQ ID NO:10) 

Fmoc-g-ycine perrtaHuoropheny. ester (46 mg. 0.1 mmo.) washed in dry ~^>S^*^ 
ethylamine^ ... 0.1 mmo.). About 0.65 rn of this rtM. washed with DMF (4x) and 
centrifuge tube, and the tube was vortexeojor 3 hou^ The beaos we P pi peridine in DMF for 30 min. 

E P r^ ra tinn nt L-P ^'y-' -PhA-l -LPu Bear* (SFQ ID NQ:1D 

Prnoc-L-proUne pentaf,uoropheny, ester (50 mg, 01 1 mmo.) «-^^^^J^t?SK 
athylamine ({V * 0.1 mmol). About 0.65 ml ofthis Z£g21 f£ washed with DMF (4x) and 
centrifuge tube, and the tube was vortexed lor 3 hou^The beads we P fi jn DMF for 30 min . 

p Fh»nrpgr.Pin Staining o * ni Y -fily-L-Phe-l -I eu Beads 

, , . . -.—ended in 450 ul of aqueous borate buffer (pH 8.5) and 54 

About 5.4 mg of Gly-Gly-L-Phe-L-Leu beB ^^^^^^^ tr6 *tment for 1 .5 hours, the beads were 
pi of a 10 uM solution of fluorescein ^-^^ffi^ FACS analysis indicated that approximately 10% of 
washed with buffer (5x), ethanol (2x), and methylene chionae ^x;. 



30 



35 



40 



45 



50 



16 



EP 0 773 227 A1 



available amino groups had been titrated with FITC. 

G. Co-coupling of L-Tyrosine and Biotin to Mixture of L-Pro-Glv-L-Phe-L-Leu and FITC labelled Glv-Gly-L-Phe-L-Leu 
Beads 

5 mg of FITC labelled Gly-Gly-L-Phe-L-Leu beads and 5 mg L-Pro-Gly-L-Phe-L-Leu beads were mixed together in 
a single tube, vortexed with a 0.1 mM solution of diisopropylethylamine in methylene chloride, and the suspension was 
divided into two equal portions. The beads were pelleted, and to one portion was added a solution containing Fmoc-O- 
tert-butyl-L-tyrosine pentafluorophenyl ester (59 mg, 95 fimol), N-hydroxysuccinimidobiotin (1.7 mg, 5 |irnol) and diiso- 
propylethylamine (17 100 junol) in dry DMF (1 ml). After vortexing for 3 hours the beads were washed with distilled 
water (2x), ethanol (2x), methylene chloride (2x) and DMF (1x). Fmoc deprotection was effected by treatment with a 
20% solution of piperidine in DMF for 30 min., and tert-butyl side chain protecting groups were removed by treatment 
with 25% tr if luoroacetic acid in methylene chloride for 30 min. The pelleted beads were washed with methylene chloride 
(2x), ethanol (2x), and TBS (1x). 

H. R-Phycoerythrin Staining of Bio tinylate d L - T yr-(Gly /L -P ro )- Q ly- U -P he-l-leu Be ads ( M ixture of S E Q ID N Q;12 and 

SEQ ID NO:13) 

Biotinylated L-tyrosine beads from (G) above were suspended in TBS (0.5 ml) and treated with 10 ^l of R-phyco- 
erythrin-avidin conjugate (Molecular Probes) for 30 min. Pelleted beads were washed with TBS (5x). 

I. Co-couDlina of L-Proline and Biotin to Mixture of L-Pro-Gly-L-Phe-L-Leu and FITC labelled Gly-Gly-L-Phe-L-Leu 
Beads (Mixture Of S5Q ID NQ;15 and SEQ ID NQ;14) 

5 mg of a mixture of L-Pro-Gly-L-Phe-L-Leu and FITC labelled Gly-Gly-L-Phe-L-Leu beads were treated with a 
solution containing Fmoc-L-proline pentafluorophenyl ester (48 mg, 95 nmol), N-hydroxysuccinimidobiotin (1.7 mg, 5 
jimol), and diisopropylethylamine (1 7 jil, 1 00 jimol) in dry DMF (1 ml). After vortex treatment for 3 hours, the beads were 
washed with DMF (2x), ethanol (2x), methylene chloride (2x), and DMF (1x). Fmoc deprotection was effected by treat- 
ment with a 20% solution of piperidine in DMF for 30 min., and by way of control, the beads were treated with 25% tri- 
f luoroacetic acid in methylene chloride for 30 min. The pelleted beads were washed with methylene chloride (2x), 
ethanol (2x), and TBS (1x). 

J. Tri-Color Staining of Biotinylated L-Pro-(Gly/L-Pro)-Glv-L-Phe-L-Leu Beads 

Biotinylated L-proline beads from (i) above are suspended in TBS (0.5 ml) and treated with 20 jnl Tri-Color: strepta- 
vidin conjugate (Caltag Labs) for 30 min. Pelleted-beads are washed with TBS (5x). 

K. Selection of Beads Containing Pep tide Lioands for Monoclonal Antibody 3E7 

Monoclonal antibody 3E7 was raised against the opioid peptide beta- endorphin. The binding specif icity of MAb 3E7 
has been well characterized by solution assays with chemically synthesized peptides. The equilibrium binding con- 
stants (Kd) of the peptides considered here are as follows: YGGFL is 6.6 nM; and YPGFL, PPGFL, and PGGFL are 
each >1 mM; thus, only the peptide YGGFL shows appreciable affinity for the antibody. 

A mixture of beads containing either YGGFL, YPGFL, PGGFL, or PPGFL and their respective tags (see above) are 
added in phosphate buffered saline (PBS) containing monoclonal antibody 3E7 that has been previously conjugated to 
colloidal superparamagnetic microbeads (Miltenyi Biotec, West Germany). After a 16h (hr) incubation at 4 °C. beads 
which bind the 3E7 antibody are selected using a high strength magnet. The selected beads are then analyzed by flow 
cytometry. Analysis of the selected beads reveals that they contain both fluorescein and R-phycoerythrin, indicating that 
only beads displaying the peptide YGGFL are selected by the 3E7 antibody. 

EXAMPLE 2: SYNTHESIS ON GLASS BEADS OF 4 PENTAPEPTIDES TAGGED WITH OLIGONUCLEOTIDE IDEN- 
TIFIERS 

A. Synthesis of Identifier Oligonucleotides (IHINfl 

The oligonucleotide identifier tags (l)-(IV) have the sequences shown below. The regions complementary to the 5* 
and 3' PCR primers are underlined. The regions complementary to the step-specific sequencing primers are shown in 
lower case: there are two steps in this example. The monomer encoding region is shown in bold type: CT 7 encodes Gly, 
TCT 6 encodes L-Pro, and TTCT 5 encodes L-Tyr in this case. Thus oligos (l)-(IV) code respectively for Gly in position 2, 
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on 5**crncnmaBacnriaa=i^^ 

m ZJt^rTT^cccrcicrecicrcTCcccmctncctttc 
crrrrcTTT^cTCCCTOT^ 3) 



ctctd 



CCraccrcrcrcrcE 



20 



(IV) *" B1B ^£^^ 

, q rH . C HF(CHo)d-NH-Biotin]-CH 2 -0-P0 2 - 

pnospMramKK* The N •^~»^ 0 S«ly N-WT^-»>~«^«^™J2 LLereeredeevedlromlhe 
iJSnu (B2) phosphorate. "ITtSi* promoted O-""* 1 ' 1 1 *°*? < tf^ 0 ,oE «. P-"* 1 W 

The primers used for PCR ana seq ^.u* 7-28 of SEQ ID NO:1) 

3- PGR Primer S'-B^tin-GG^GAAG^^ ^ 3 , (SEQ , D ^ 
Step #1 Sequencing Primer |,^SSS^AGAAAGGG-3' (SEQ iD NO T) 
Step #2 Sequencing Primer 5 -AGGAAAQfc./*^ 



5-_BlB2-CnTCT 



30 



35 



50 



55 



Step 7F£ ot^uci I*,.. - 



18 



EP 0 773 227 A1 

C. Preparation of L-Pro-Glv-L-Phe-L-Leu Beads Bearing Identifier Oliao (II) 

5 mg of Gly-L-Phe-L-Leu beads are treated as in (b) above, substituting Fmoc-L-Pro-OH and Oligo (II) for Fmoc- 
Gly-OH and Oligo (I), respectively. 

5 

D. Preparation of(Oteu)-L-Tvr-fGlv/L-Pro)-Glv-L-Phe-L-Leu Beads Bearing Identifier Oliaos (III and t/lh 

Beads from (b) and (c) are pooled and divided into two equal portions. One portion is treated as in (b), substituting 
Fmoc(OtBu)-L-Tyr-OH and Oligo (III) as appropriate. 

10 

E. Preparation of L-Prp-(Oly/U-PrQ)-QlY-L-Phe-L-Leu Bea^s Bearing Identifier Qliqos (IV and VII) 

The second pool is heated as before, substituting Fmoc-L-Pro-OH and Oligo (IV) as appropriate. 

15 F. Reconstitution and Deprotection of the Peptide Library 

Beads from (d) and (e) are pooled, and the phosphate, amino acid side-chain, and nucleotide exocyclic amino pro- 
tecting groups are removed as follows. A one hour treatment with a 1 :2:2 mixture of thiophenol: triethylamine: p-dioxane 
is followed by washing the beads with methanol (2x) and then methylene chloride (2x), and then the beads are treated 
20 for 5 min. with 95:5 trif luoroacetic acid: ethanedithiol. After a wash with methanol (3x), the beads are treated at 55 °C 
with 1 :1 ethylenediamine: ethanot for 1 hour and then washed first with ethanol (2x) and then with PBS (2x). This col- 
lection of beads constitutes the library and contains approximately equal quantities of the 4 immobilized peptides 
YGGFU YPGFL, PGGFL and PPGFL. Additionally, each bead carries two distinct 113 bp oligonucleotide sequences 
encoding the identities of both the first and second amino acids of the peptide on that bead. 

25 

G. PGR Amplification of Oligonucleotide Identifier Tag 

After a FAC sort of affinity purified beads into individual 0.5 mL polypropylene tubes, 25 \s\ of TBS containing 0.1 
ug salmon sperm DNA (as carrier) are added together with 25 ul of 2X PCR Amplification Buffer (PECI) to each tube. 

30 The 2X buffer contains: 100 mM KCI; 20 mM Tris-CI, pH 8.4, 20 degrees C; 6 mM MgCI 2 ; 0.4 mM dNTP's; 1 \iM of 5' 
PCR primer; 1 uM of 3' PCR primer; and 100 units/ml Taq DNA polymerase. 

After buffer addition, the sample is covered with 50 ul of mineral oil and transferred to an automated thermal cycler. 
In the thermal cycler, the samples are heat denatured at 95 °C for 2 min, and then cycled 35 times through 3 steps: 
95°C /30 sec., 60°C/1 min., 72 °C /1 min., which steps are followed by an incubation at 72 °C for an additional 5 min. 

35 and then the tubes are cooled and held at 1 5 °C until ready for processing on streptavidin beads. The mixture is heated 
to 95 °C to denature the strands, and the biotinylated purine strand and excess 3* PCR primer are removed by addition 
of streptavidin-coated beads. The tubes are centrifuged at 2005 for 5 min. The supernatant is used in the sequencing 
reactions, as described below. 

40 H. Sequencing of PCR Amplified Oligonucleotide Tags 

The amplified oligonucleotides from individual bead isolates are sequenced in a pair of reactions (using ddA or ddG 
as chain terminators) with either the Step #1 -specific or the Step #2-specific sequencing primers. 

To anneal the template and primer, for each set of two sequencing lanes, a single annealing and subsequent labe- 
45 ling reaction is run by combining 8.5 uJ of sequencing primer (conc.= 0.25 pmol/ul), 1 .5 ul Sequenase™ 5X sequencing 
buffer (200 mM Tris HCI. pH 7.5; 100 mM MgCfe; and 250 mM NaCI), and 10 ^l of template DNA from the amplification 
supernatant above. The samples are heated for 2 minutes at 65 °C and allowed to cool slowty to room temperature 
(approx. 10 minutes). 

The labeling reaction is performed as follows. Sequenase™ (v2.0) is diluted 1 : 20 with TE (10 mM Tris HCI, pH 7.5; 
50 and 1 mM EDTA), and a labeling cocktail containing a 2 : 3.5 ratio of diluted enzyme to labeling mix (i.e., a 4 : 2 : 1 mix- 
ture of 150 nM dGTP, 0.1 M dithiothreitol. alpha-^S-dATP, >1000 Ci/mmol) is prepared. About 5.5 ul of the cocktail are 
incubated with 10 uJ of annealed template/primer (from (i)) at 25 °C for 5 min. 

The termination reactions are performed as follows. 6 ul of labeling reaction mixture are added to 5 \x\ of each of 
the appropriate ddXTP termination reaction mixes (i.e., 80 uM dGTP, 80 uM dATP, 50 mM NaCI, and 8 uM ddGTP or 8 
55 uM ddATP). After incubation at 37 °C for 5 min., about 8 ul of Stop Solution (95% formamide, 20 mM EDTA, 0.05% 
bromophenol blue, and 0.05% xylene cyanol) are added to each of the termination reactions. 

The sequencing gel is comprised of 6% total acrylamide (19:1 acrylamide/bis), 0.09 M Tris base. 0.09 M boric acid, 
1 mM EDTA, and 7 M urea. The gel is polymerized by addition of 1.9 *d of 25% ammonium persuHate per ml and 0.72 
Ml of TEMED per ml of above gel solution. The gel is allowed to polymerize at least one hour and is prerun at least 20 
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roinu.es prior to sample .oading. Gel plates are then maintained between 40 and 50 »C pnor to 

Reasons are heated to 85-95 °C for 2 minutes prior to loading, and the gel .s run unt.l the bromophenol blue dye 
r „,h^r™ ofthe ael The sequences of interest run between the bromophenol and xylene cyanol markers. 
^m^SXfri AS ^sequence o, monomers in the oligomers attached to the bead ,s contained in 
the DNA sequence information. 

EXAMPLE 3: PARALLEL SYNTHESIS OF PEPTIDES AND OLIGONUCLEOTIDE TAGS ON CARBOXYL BEADS 
A. S ynthesis of Phosohor amiriites fIVflVI 

The 3-fallvl NNdiisopropyl-phosphoramidites) of 5'-DMT derivatives of: (1) ^Kallyloxy)ca^l-7-de^a-! Z- 
deoSenoS ^rKKca^y.-2 -deoxy-cytidine; (3) ^^^^^f^^^ 
aS W th^nSne (see Rgure 6) are prepared according to the procedures of Hayakawa el *. J Amer Ohem. Soc. 

112 : 1691-1696 

B P°ri"flt^i" 1 P-arhnxvl Bea rfr Wit* " famine Unker 

♦^h* hlad ^e^Diisooropylethylamine (DIEA, 54 nl. 0.30 mmol) was added, and the suspens.on was vortexed for 1 
to the bead pellet. ^ e ^'^ e J^_ amj ^ e (20 ^ 0 .10 mmol) was then added, and the reaction was vor- 

S^^^S- with 9:1 DMF/water (1.0 ml) and vortex* for 15 minutes. The beads were then pel- 
leted. the supernatant decanted, and the beads washed with DMF (3 x 1 .0 ml ). 

r. Attarhinn Peptide and Oligonucleotide Syn thesis Linkers 

1 00 mg of the beads are treated with a mixtore of *" F ~"^^ 
(DMD-hydroxybutyric acid (0.1 ^mol) in the presence of HBTU (0.1 mmol . HO« (0.1 nrnjj and £^ <* 1 ™ 
7run nMF I 1 0 m n After vortex treatment for 30 minutes, the reaction mixture is diluted with DMF (1 .0 rnl^the 
bS2e£ decanted. The beads are washed w* , DMF f or 1X » m^The coupl.ng procedure 
islnen repeated with fresh reagents, and the beads are pelleted and washed as descnbed above. 

n Buildin g a a- PCR Prim inn Site on the Hydroxy Linkers 

Th» naraiiel assemblv of oligonucleotide-tagged peptides on beads is illustrated in Figure 8. A PCR primingsite of 
20 £*^e^^^™<™ Note that a., reagents used are anhydrous, and " 
2iS3£?£5 arS About 1 0 mg of the beads are subjected to an eight-step reaction sequence £ ■ ? °" 

S eTght are repeated from one to 25 times to assemble a PCR priming site of up to 25 nucleotides. 
p (-.nii plinq of First A mino Acid to Amino Linkers. 

Peptide and nucleotide couplings may be alternated, as illustrated ^Figure 8. To couple an ™™ ^< 0 ^ J* 
the Fmoc group is first removed from the beads by treatment with 30% pipendine in DMF for 60 mm. The , beads are 
wash* 3 S with DMF The beads are then treated with a solution containing the appropriate amino acid (0.1 M) 
hIt^O 1 M) HoS (oTm). and DIEA (0.1 M) in 9:1 CH 2 CI 2 :DMF for 30 min. The coupl.ng is then repeated with fresh 
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reagents for a further 30 min. and the beads are washed with DMF (3x) and then with MeCN (3x). 

F. Construction of First Oligonucleotide "Codon" 

A "codon" of about 3 to 5 nucleotides uniquely representing the identity of the first amino acid is then built at the 5' 
end of the oligonucleotide chain using the 8- step coupling cycle in procedure (d) above. 

G. Coupling of Subsequent Amino Acids and "Codon" Construction 

The methods of procedures (e) and (f) are then repeated using the appropriate amino acid and nucleotide building 
blocks until the desired peptide and the oligonucleotide coding region are completely assembled. 

H. Construction of a 5' PCR Priming Site 

The 8-step coupling cycle of procedure (d) is used to build a 20-25 nucleotide PCR priming site on the 5' terminus 
of the oligonucleotide chains. 

I. Deprotection of the Oligonucleotide and Peptide Chains 

The fully assembled peptide and oligonucleotide chains are deprotected as follows. The amino-terminal Fmoc 
groups are removed by treatment with 30% piperidine in DMF and then a wash with THF (3x). To remove the allylic pro- 
tecting groups, the beads are treated with a THF solution containing tris(dibenzylideneacetone) dipalladium-chloroform 
complex (0.02 M), triphenylphosphine (0.2 M), and 1 :1 n-butylamine/formic acid (1 .2 M) at 50°C for 30 min. and the pel- 
leted beads are washed with THF. The beads are washed with 0.1 M aqueous sodium N.N-diethyldrthiocarbamate and 
then water to remove traces of palladium. The amino acid protecting groups are then removed by treatment with 95:5 
TFA/water for 30 min. "Scavenger" reagents such as 1 ,2-ethanedithiol and thioanisole may also be included in this 
acidic deprotection medium (e.g., 2% of each by volume). Finally, the fully deprotected beads are washed with aqueous 
buffer and are ready for interaction with a biological receptor. 

EXAMPLE 4: LIBRARY PREPARATION AND SCREENING 

In this example, two populations of amine derivatized beads were labeled with oligonucleotides possessing base 
sequences uniquely characteristic of each bead population. The population labeled with an oligonucleotde 95 bases in 
length (95 mer) was subsequently coupled to the peptide YGGFL The population of beads labeled with an oligonucle- 
otide 1 10 bases in length (110 mer) was coupled to phenyalanine (F). The beads were then mixed in the ratio of twenty 
F/110 mer beads for each YGGFL/95 mer bead and stained with a fluorescentfy labeled antibody 3E7 that binds the 
peptide YGGFL with high affinity. Individual f luorescentfy stained beads could then be sorted by FACS directly into PCR 
tubes. After PCR, 5 of 6 fluoresently stained beads gave rise to a fragment of amplified DNA 95 bp long. PCR of the 
remaining single bead gave rise to small DNA fragments, possibly being primer dimer. 

The oligonucleotides used in this experiment are the two tags, two PCR primers, and one sequencing primer. The 
same PCR and sequencing primers were used for the two tags. The two tags differ in their sequence and length. Both 
tags were composed of the bases 7-deazaA, C, and T. 

The 95 mer tag has the sequence: 

CCA CTC ACT ACC ACT CTA CTA TA A CC A CCC CTT CCT ATT CCA AAA TTA 
CAA Act tat etc aae tac ate tCA C AC TC A CTC ATC TCT AC A TCT AC (SEQ ID 
NO:8) 

The 110 mer tag has the sequence: 

CCA CTC ACT ACC ACT CTA CTA TAA C CC TCC CCT ATT CCA AAA TTA CAT 
CCT ATT CCA AAA TTA CAA Act tat etc aac tac ate t CA CAC TCA CTC ATC TCT 
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ACATCTAC (SEQ ID NQ9) 

M rh tampt thfi underlined seauences represent PCR primer binding sites: the sense primer is at the 5'-end, and 
the anti-sense primer is at the 3'-erxi Also, for each target the small case sequence represents the sequencing primer 
binding site. 

A. Bead Preparation 

Beads were purchased from Bang's Laboratories (979 Keystone Way, Carmel, IN 46032) and are composed of car- 
bonated polystyrene (4.5 urn average diameter). These beads were subjected to diamine derivitization by the process 
described below. 

Beads (200 mg) were treated with 1.0 ml of 1 N HCI and vortexed 1 5 min. The beads were pelleted, decanted, and 
washed with three times (3x) with 1 .0 mL of water each wash and then washed 3x with 1 .0 ml of DMF each wash. To 
the washed pellet was added 2-(1 H-benzotriazol-1 -yl)-1 .1 ,3.3-tetramethy!uronium hexaf luorophosphate (HBTU; 38 mg. 
0 1 mmole), 1 -hydroxybenzotriazole hydrate (HOBT; 15 mg, 0.1 mmole), 500 »l of methylene chloride, and 54 »i1 of 
diisopropylethylamine (DIEA; 0.3 mmole). After vortex treatment for 2 min., 20 nl of diamine (4,9<iioxa-1,12-dodecan- 
ediamine- 94 nmole) were added. After vortex treatment for 30 min, 1 .0 ml of DMF was added, and the beads were pel- 
leted by centrifugation. The supernatant was removed, and 1 .0 ml of 10% water in DMF was added. The beads were 
vortexed an additional 15 min. and finally washed 3x with 1 .0 ml of DMF each wash. 

B Oiigoniinlftotide Attachment 

Two different target oligonucleotides were employed in this experiment: a 95 mer and a 110 mer. These oligonucle- 
otides were composed of the bases cytidine, thymidine, and 7-deaza adenosine. The oligonucleotides were synthe- 
sized with a primary amino group on the S'-terminus (5'-amino modifier C-12. Glen Research). Lyoph.hzed 
oligonucleotide (600 pmole) was dissolved in 5 ^l of 0.5 M Na-phosphate. pH 7.7, and the solution was treated with 10 
ul of 0.2 M disuccinimydylsuberate (DSS). The reaction proceeded 10 min., and then 85 ^l of ice-cold water were 
added Unreacted DSS was removed by centrifugation. The supernatant was passed through a G25 spin column that 
had been equilibrated with water. The eluant was immediately frozen and lyophilized to isolate the 5'-N-hydroxysucci- 
namkJe ester of the oligonucleotide. ^ « ^ , , 

This activated oligonucleotide was dissolved in 50 of 0.1 M Na-phosphate, pH 7.5. which contained 0.1 mg/mL 
of sonicated salmon sperm DNA. This solution was added to 10 mg of diamine derivitized beads. After vortex treate- 
ment for 3h(hr)the beads were washed 2x with 0.4 mL of 0.1 M Na-phosphate, pH 7.5, each wash, and then washed 2x 
with 0.4 ml of 0.1 N NaOH. Finally, the beads were washed with 3x with 0.4 ml of pH 7.5 buffer. 

n. Peptide Attachment 

To Boc-YGGFL or Boc-Phe (Boc - t-butoxy-carbonyl amine protecting group; 0.1 mmole) was added HBTU (0.1 
mmole), HOBT (0. 1 mmole), 1 .0 ml of 10% DMF in methylene chloride, and DIEA (0.3 mmole). After vortex treatment 
to dissolve the solids, 0.4 m I of the peptide solution was added to 3 mg of oligonucleotide-labeled beads. The solution 
containing Boc-YGGFL was added to beads labeled with the 95 mer, and the solution containing Boc-Phe was added 
to beads labeled with the 1 1 0 mer. The reaction mixtures were vortexed 30 min. and then diluted with DMF, centnfuged, 
decanted, and the bead pellets washed with 3x with 1 .0 ml of THF. The Boc protecting groups were removed by treating 
the beads with 0 4 ml of 95% trifluoroacetic acid for 10 min. The deprotection reaction was then diluted with THF, cen- 
trifuged. and decanted, and the beads were washed with 3x with 1.0 ml of DMF each wash. Finally, the beads were 
washed with 3x with 0.5 ml of 0.1 M Na-phosphate. pH 7.5. each wash and stored as a slurry (10 mg/ml). 

D. Mixing. Stai ning, and Sorting 

The beads coupled with the 95 mer and YGGFL were mixed with the beads that were coupled to the 1 10 mer and 
F in the ratio of 1 20. Thus. 0.1 mg of 95 mer/YGGFL beads (2 million beads) were mixed with 2.0 mg of the 110 
mer/Phe beads (40 million beads). The mixture was suspended in blocking buffer (PBS. 1% BSA. and 0.05% Tween- 
20) and incubated at room temperature for 1h The beads were next pelleted by centrifugation and resuspended in a 
solution of an FITC-labeled monoclonal antibody 3E7 that binds the peptide YGGFL (1 ug/ml). The suspension was 
incubated 0.5 on ice and then centrifuged to isolate the bead pellet. 

The beads were resuspended in PBS for delivery into the fluorescence activated cell sorting (FACS) instrument 
(Becton Dickinson FACSORT Plus). Beads that had bound to the f luorescently labeled antibody were identified by their 
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acquired fluorescence, and fluorescent beads were isolated by sorting into PCR tubes. One, ten, or one hundred fluo- 
rescent beads were sorted into each PCR tube. In an analogous manner, non-fluorescent beads were also sorted into 
PCR tubes. 

s E. Amplification of Sorted Beads 

To each PCR tube containing a bead or beads was added 25 *iL of PCR buffer (20 mM Tris-HCI, pH 8.7; 10 mM 
KCL; 10 mM (NH4) 2 S0 4 ; 2 mM MgCI 2 ; 0.1% Triton X-100; 0.14 mg/ml BSA; 200 um dATP; 200 \itr\ dGTP; 200 urn 
dCTP; 200 \im dTTP; 2 um primer #1 ; 2 ^im primer #2; and 0.5 units of Pfu DNA polymerase). Reactions were sub- 
10 jected to 40 cycles of 95° C for 0.5 min., 55° C for 1 min., and 72°C for 1 min. 

Gel loading dye (2 nL) was added to 10 \i\ of each PCR, and the sample was run on a 2% low melting point agarase 
gel. DNA fragments were visualized by staining with ethidium bromide and exposure to UV light. Five of six of the tubes 
containing single flourescent beads gave rise to DNA fragments 95 base pairs in length, confirming that these beads 
were coupled to YGGFL and not F. Tubes containing 10 or 100 fluorescent beads also gave rise to 95 mer DNA frag- 
15 ments. Conversly, none of the tubes containing 1 , 10, or 100 non-fluorescent beads gave rise to 95 mer fragments. 

There were, however, anomalous amplification products smaller than 1 1 0 bp from amplification of the tags of non- 
fluorescent beads. These anomalous products may have arisen through the use of unprotected oligonucleotide tags in 
this example, which may have allowed the free exocyclic amines to couple to the F amino acid, thereby rendering the 
tag subject to anomalous amplification. This problem would not have affected the 95 mer tag to the same extent, 
20 because YGGFL would be less reactive with the exocyclic amines than F. 

EXAMPLE 5: LIBRARY SYNTHESIS AND SCREENING 

This example is illustrated schematically in Figure 9. Briefly, a single population of amine derivatized beads (pre- 
25 pared as described in Example 4) was coupled to glycine. The population was then divided into two equal parts, and 
each part was labeled with a characteristic oligonucleotide that would uniquely identify the bead subpopulation. The 
subpopulation that had been labeled with an oligonucleotde 95 bases in length (the 95 mer described in Example 4) 
was subsequently coupled to the peptide YGGFL. The population of beads that had been labeled with an oligonucle- 
otide 110 bases in length (the 1 10 mer described in Example 4) was coupled to the peptide FLFLF. (SEQ ID NO:16) 
30 The beads were then mixed in the ratio of twenty FLFLF/1 10 mer beads for each YGGFL795 mer bead (i.e., 20:1) and 
stained with a f luorescently labeled antibody (3E7) that binds the peptide sequence YGGFL with high affinity. Individual 
fluorescently stained beads and unstained beads were sorted directly into PCR tubes. Upon PCR, all the fluoresently 
stained beads gave rise to a fragment of amplified DNA 95 base pairs in length, and all the unstained beads gave rise 
to a fragment 110 base pairs in length. 

35 

A. Peptide Coupling Step #1 

To Fmoc-Gly (Fmoc = 9-fluorenylmethoxycarbonyl amine protecting group; 0.1 mmole) was added HBTU (0.1 
mmole), HOBT (0.1 mmole), 1.0 ml of 10% DMF in methylene chloride, and DIEA (0.3 mmole). After vortex treatment 
40 to dissolve the solids, 0.4 mL of the solution containing the activated amino acid was added to 50 mg of diamine deri- 
vatized beads. The reaction mixture was vortexed 30 min. and then diluted with DMF, centrifuged, decanted, and the 
bead pellet washed twice with 1.0 ml of DMF. The coupling reaction was then repeated. The beads were then treated 
with 1 .0 m I of 30% piperidine in DMF with vortexing for 1h to deprotect the glycine amino group. 

45 B OlignmiriPntiHA I shying 

Two different target oligonucleotides were employed in this experiment: the 95 mer and 110 mer described in 
Example 4. Half of the bead sample described above (25 mg) was labeled with the 95 mer, and the other half was 
labeled with the 1 10 mer. These oligonucleotides are composed of 2*-deaxy-cytidine, thymidine, and 2*-deoxy-7-deaza- 

50 adenosine. The oligonucleotides were synthesized with a primary amino group on the 5 -terminus (MMT-Cl2-Ami- 
nomodifier, Clonetech Laboratories, Inc.). Lyophilized oligonucleotide (1.5 nmole) was dissolved in 10 ^ of 0.5 M Na- 
phosphate, pH 7.7. and the solution was then treated with 20 of 0.2 M disuctinimydylsuberate (DSS). The reaction 
proceeded 10 min., and then, 70 ul of ice-cold water were added. Unreacted DSS was removed by centrifugation. The 
supernatant was passed through a G-25 spin column that had been equilibrated with water. The eluant was immedi- 

55 ately frozen and lyophilized to isolate the 5'-N-hydroxysuccinamide ester of the oligonucleotide. This activated oligonu- 
cleotide was dissolved in 100 \i\ of 0.1 M Na-phosphate, pH 7.5, which contained 0.1 mg/ml of sonicated salmon sperm 
DNA. This solution was added to 25 mg of glycine-coupled beads. After vortex treatment for 3h the beads were washed 
twice with 0.4 m of 0.1 M Na-phosphate. pH 7.5. and twice with 0.4 ml of 0.1 N NaOH. Finally, the beads were washed 
three times with 0.4 ml of pH 7.5 buffer. 
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Xo Boc-YGGFL or Boc-FLFLF (Boc = ^^^SSSS^SSl ZZttS+Z™ 
(0.02 mmole). HOBT (0.02 mmole). 0.190 mof 10% DMF .n ^""JJJJ ethylene chloride. An aliquot of this 
treatment to dissolve the solids, the solution ^2S^2!il£S£Sd £eads (25 mg). The solution contain- 
solution (0.345 ml ) was added to the 

ing Boc-YGGFL was added to beads tebeled «^ th ° 95 ^^ 30 min . and then diluted with DMF. centrrfuged, 
beads labeled with the 1 10 mer. The re KMnn^ esw« e ^xeo we re removed by 



n Miifini^. Staining SPd Sorting 



^ beads coupled to *e 1 10 mer ^ ^ 

YGGFL in the ratio of 20 .1. Thus. 0.1 mg of 95 mer/YQGFL beads (2 m«»o j 0.05% Tween-20) 

mer/FLFLF beads (40 million beads) ^^^^^^y^i^iation and resuspended in a solu- 
and incdbated at room temperature ^V^^^S^Se peptide sequence YGGFL (1 ng/m. ). The sus- 
tion of an FITC-labeled monoclonal antibody C 3 ^"^** 59 " „„ et 

pension was incubated 0.5 h on ice and ton "M*""* ££j££'2£ |Bd ce „ Mg (FA CS) instrument 
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Toeach PCRtubecorrtaining ab^ 200 
10 mM (NH 4 ) 2 S0 4 . 2 mM MgCI 2 . 0.1% ^^g^J^^SSSl DNA polymerase). Reactions were 
uM dTTP, 2 |iM of each primer (as* -escrtoedin I Eot*m *>.^£ 5 ™* * — Ge , ^ dye (2 , 0 was added to 
subjected to 40 cycles of 95°C for 0.5 m.n.. S5 C^ mm.^and T^ ^ ^ by 

cert^lation^roduced only fragments 1 10 base pa.rs .n length (see F.gure 11). 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: DOWER, WILLIAM J 
BARRETT, RONALD W 
GALLOP, MARK A 
NEED ELS, MICHAEL C 

(ii) TITLE OF INVENTION: METHOD OF SYNTHESIZING DIVERSE 
COLLECTIONS OF OLIGOMERS 

(iii) NUMBER OF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: TOWNS END AND TOWNSEND 

(B) STREET: 1 MARKET PLAZA, STEUART TOWER, SUITE 2000 

(C) CITY: SAN FRANCISCO 

(D) STATE : CALIFORNIA 

(E) COUNTRY: USA 

(F) ZIP: 94105 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Smith, William M. 

(B) REGISTRATION NUMBER: 30,223 

(C) REFERENCE/ DOCKET NUMBER: 11509-36-1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-543-9600 

(B) TELEFAX: 415-543-5043 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTCT TTTTTTCTCC TTCTTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C 111 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C 111 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTT CTTTTTCTCC TTTTCTTTTT CTCTCCCTCT 60 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCTTC TTTCC 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCTTC TTTCC 115 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
GGAAAGAAGA GAGAGAGGAG AGG 23 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
^0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AGAGAGGGGA AAGGAAGA 18 
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(2) INFORMATION FOR SEQ ID HO-.7: 

(D) TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
(xi , SEQUENCE DESCRIPTION: SEQ ID HO:7: 
AGGAAAGGAG AGAAAGGG 

(2) INFORMATION FOR SEQ ID HO:8: 

tvX TYPE: nucleic acid 
( ( C) STRANDEDNESS: single 
» (d) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ^S: ^ 
" CCACTCACTA CCACTCTACT ATAACCACCC CTTCCTATTC 

AACTACATCT CACACTCACT CATCTCTACA TCTAC 

(2) INFORMATION FOR SEQ ID NO:9: 

fBl TYPE: nucleic acid 
( ,C1 STRANDEDNESS:, Single 
S3 TOPOLOGY: linear 

MOLECULE TYPE: cDNA 

cco«» cccrcx^ xt^ccccc cer*^ c 

CTCTCACC T»CT«T 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Gly Phe Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 11: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 



45 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Gly Phe Leu 
1 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tyr Gly Gly Phe Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Tyr Pro Gly Phe Leu 
1 5 



INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Gly Gly Phe Leu 
1 5 



INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Pro Pro Gly Phe Leu 

1 5 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Phe Leu Phe Leu Phe 
1 5 



Claims 

1. A process for preparing a new pharmaceutical drug or diagnostic reagent, which includes the step of screening 
against a ligand or receptor a library of different synthetic compounds, which compounds are obtainable by synthe- 
sis in a component by component fashion which links each compound to one or more identifier tags which enable 
subsequent identification of reactions through which said components were incorporated and consequent deduc- 
tive structural identification of said members. 

2. A process of claim 1 , wherein the library comprises a plurality of different members, each member comprising an 
oligomer composed of a sequence of monomers linked to one or more identifier tags identifying the sequence of 
monomers in said oligomer. 

3. A process of claim 2, wherein said linkage between said oligomer and said identifier tag comprises a solid support. 

4. A process of claim 2 or claim 3, wherein said identifier tag is attached to said oligomer. 

5. A process of any one of claims 2 to 4 , wherein the library has about 1 0 6 different members. 

6. A process of any one of claims 2 to 5, wherein said oligomer is a peptide or an oligonucleotide. 

7. A process of any one of claims 2 to 6, wherein said identifier tag is a fluorescent marker. 

8. A process for preparing a new pharmaceutical drug or diagnostic reagent, which includes the step of screening 
against a ligand or receptor a tagged synthetic oligomer library produced by synthesizing on each of a plurality of 
solid supports a single oligomer sequence and one or more identifier tags identifying said oligomer sequence, said 
oligomer sequence and identifier tags synthesized in a process comprising the steps of: 

(a) apportioning said supports among a plurality of reaction vessels; 

(b) exposing said supports in each reaction vessel to a first oligomer monomer and to a first identifier tag; 

(c) pooling said supports; 

(d) apportioning said supports among a plurality of reaction vessels; 

(e) exposing said supports to a second oligomer monomer and to a second identifier tag monomer; and 

(f) repeating steps (a) through (e) from at least one to twenty times. 

9. The use of a solid support in pharmaceutical drug or diagnostic reagent identification, said solid support comprising 
a first particle attached to a second particle, said first particle linked to an oligomer and said second particle linked 
to an oligonucleotide identifier tag, and wherein said oligomer is other than an oligonucleotide. 

10. A process for preparing a new pharmaceutical drug or diagnostic reagent, which includes the step of screening 
against a ligand or receptor an oligomer library which is obtainable by a process comprising: 

recording each step in a sequence of oligomer monomer additions in the synthesis of an oligomer library, the 
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method comprising adding an identifier tag in conjunction with the addition of each monomer and pedormingat 
Sto cX of monomer and tag addition, thereby forming a series of identifier tags .dentif y.ng said ohgomer 
sequence. 

11 A process of claim 10. wherein, in said process whereby the oligomer library is obtainable, after the addition of a 
first identifier tag. subsequent identifier tags are attached to the terminus of preexisting tag. 

12 A process of claim 10 or claim 1 1 . wherein, in said process whereby the oligomer library is obtainable, a peptide 
and an oligonucleotide on a solid support are synthesized, said method comprising: 

(a) preparing a Afunctional solid support containing a first type of active site blocked with a first type of protect- 
ing group and a second type of active site blocked with a second type of Pjotec^ group. 

(b) reacting said solid support with an activator to remove said first type of protecting group thereby exposing 

said first type of active site; . 

(c) coupling an oligonucleotide monomer or an oligonucleotide to said first type of active site, 

(d) reacting said solid support with an activator to remove said second type of protecting group thereby expos- 
ing said second type of active site; 

(e) coupling a peptide monomer or peptide to said second type of active site; and 
(0 repeating steps (b) through (e) from one to twenty times. 

13. A process of any one of claims 1 to 8. 10. 11 or 12. wherein said drug or diagnostic reagent is developed from the 
result of said screening. 

14. Aprocossforpreparingapesticideorhe^^ 

or 1 3 modified by said screening and/or development being to produce a pesticide or herbrade. 

15 The use of a solid support in pesticide or hertoicide identification, said solid support comprising a first particle 
ISacrSd £ a sSond 5Se. said first particle .inked to an oligomer and sa« second particle ..nked to an ohgonu- 
cleotide identifier tag. and wherein said oligomer is other than an oligonucleotide. 

16. A pharmaceutical drug or diagnostic reagent obtained by a process of any one of claims 1 to 8, 10. 11. 12 or 13or 
identified in a use of claim 9. 

17. A pesticide or herbicide obtained by a process of claim 14 or identified in a use as claimed in daim 15. 
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